A Two-Stage Ensemble of Diverse Models for Advertisement Ranking in KDD Cup 2012

نویسندگان

  • Kuan-Wei Wu
  • Chun-Sung Ferng
  • Chia-Hua Ho
  • An-Chun Liang
  • Chun-Heng Huang
  • Wei-Yuan Shen
  • Jyun-Yu Jiang
  • Ming-Hao Yang
  • Ting-Wei Lin
  • Ching-Pei Lee
  • Perng-Hwa Kung
  • Chin-En Wang
  • Ting-Wei Ku
  • Chun-Yen Ho
  • Yi-Shu Tai
  • I-Kuei Chen
  • Wei-Lun Huang
  • Che-Ping Chou
  • Tse-Ju Lin
  • Han-Jay Yang
  • Yen-Kai Wang
  • Cheng-Te Li
  • Shou-De Lin
  • Hsuan-Tien Lin
چکیده

This paper describes the solution of National Taiwan University for track 2 of KDD Cup 2012. Track 2 of KDD Cup 2012 aims to predict the click-through rate of ads on Tencent proprietary search engine. We exploit classification, regression, ranking, and factorization models to utilize a variety of different signatures captured from the dataset. We then blend our individual models to boost the performance through two stages, one on an internal validation set and one on the external test set. Our solution achieves 0.8069 AUC on the public test set and 0.8089 AUC on the private test set.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Personalized Ranking for Non-Uniformly Sampled Items

We develop an adapted version of the Bayesian Personalized Ranking (BPR) optimization criterion (Rendle et al., 2009) that takes the non-uniform sampling of negative test items — as in track 2 of the KDD Cup 2011 — into account. Furthermore, we present a modified version of the generic BPR learning algorithm that maximizes the new criterion. We use it to train ranking matrix factorization model...

متن کامل

Combining Predictors for Recommending Music: the False Positives' approach to KDD Cup track 2

We describe our solution for the KDD Cup 2011 track 2 challenge. Our solution relies heavily on ensembling together diverse individual models for the prediction task, and achieved a final leaderboard misclassification rate of 3.8863%. This paper provides details on both the modeling and ensemble

متن کامل

Bayesian Personalized Ranking for Non-Uniformly Sampled Items

In this paper, we describe our approach to track 2 of the KDD Cup 2011. The task was to predict which 3 out of 6 candidate songs were positively rated – instead of not rated at all – by a user. The candidate items were not sampled uniformly, but according to their general popularity. We develop an adapted version of the Bayesian Personalized Ranking (BPR) optimization criterion [9] that takes t...

متن کامل

Novel Models and Ensemble Techniques to Discriminate Favorite Items from Unrated Ones for Personalized Music Recommendation

The track 2 problem in KDD Cup 2011 (music recommendation) is to discriminate between music tracks highly rated by a given user from those which are overall highly rated, but not rated by the given user. The training dataset consists of not only user rating history but also the taxonomic information of track, artist, album, and genre. This paper describes the solution of the National Taiwan Uni...

متن کامل

Feature Engineering and Ensemble Modeling for Paper Acceptance Rank Prediction

Measuring research impact and ranking academic achievement are important and challenging problems. Having an objective picture of research institution is particularly valuable for students, parents and funding agencies, and also attracts attention from government and industry. KDD Cup 2016 proposes the paper acceptance rank prediction task, in which the participants are asked to rank the import...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012